Assessing naive Bayes and support vector machine performance in sentiment classification on a big data platform

نویسندگان

چکیده

<p><span lang="EN-US">Nowadays, mining user reviews becomes a very useful mean for decision making in several areas. Traditionally, machine learning algorithms have been widely and effectively used to analyze user’s opinions on limited volume of data. In the case massive data, powerful hardware resources (CPU, memory, storage) are essential dealing with whole data processing phases including, collection, pre-processing, an optimal time. Several big technologies emerged efficiently process like Apache Spark, which is distributed framework that provides libraries implementing algorithms. order evaluate performance Spark's library (MLlib) large classification accuracies time two implemented spark: naive </span><span>B</span><span lang="EN-US">ayes support vector (SVM) compared achieved by standard implementation these different size datasets built from movie reviews. The results our experiment show classifiers running under spark higher than traditional ones reaches F-measure greater 84%. At same time, we found framework, relatively low.</span></p>

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sentiment Analysis Technique: A Look into Support Vector Machine and Naive Bayes

Sentiment Analysis and opinion mining aims to analyze sentiments, opinions, emotions etc. towards products, services or current topics. There are various approaches applied to mine the sentiments portrayed. Supervised machine learning is one such approach that is generally applied. The aim of this paper is to investigate the current methods used to perform sentiment analysis by reviewing and co...

متن کامل

Naive-Bayes for Sentiment Classification

This report details the findings in building a naive Bayes sentiment classifier for a IMDB movie-review data set using Scala and ScalaNLP. We studied the unigram or bagof-words Bernoulli and Multinomial models and a number of different feature selection techniques, including term frequency, mutual information and Chi-squared. 1. DATA CORPUS The corpus contains of 2000 rated movie reviews, compr...

متن کامل

A Novel Technique for Fingerprint Classification based on Naive Bayes Classifier and Support Vector Machine

Fingerprint classification decreases the number of possible matches in automated fingerprint identification systems by categorizing fingerprints into predefined classes. Support vector machines are widely used in pattern classification and have produced high accuracy when performing fingerprint classification. In order to effectively apply Support vector machines to multi-class fingerprint clas...

متن کامل

Forecasting Stock Price Movements Based on Opinion Mining and Sentiment Analysis: An Application of Support Vector Machine and Twitter Data

Today, social networks are fast and dynamic communication intermediaries that are a vital business tool. This study aims at examining the views of those involved with Facebook stocks so that we can summarize their views to predict the general behavior of this stock and collectively consider possible Facebook stock price movements, and create a more accurate pattern compared to previous patterns...

متن کامل

Quantum support vector machine for big data classification.

Supervised machine learning is the classification of new data based on already classified training examples. In this work, we show that the support vector machine, an optimized binary classifier, can be implemented on a quantum computer, with complexity logarithmic in the size of the vectors and the number of training examples. In cases where classical sampling algorithms require polynomial tim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IAES International Journal of Artificial Intelligence

سال: 2021

ISSN: ['2089-4872', '2252-8938']

DOI: https://doi.org/10.11591/ijai.v10.i4.pp990-996